Robust optimal classification trees under noisy labels

نویسندگان

چکیده

Abstract In this paper we propose a novel methodology to construct Optimal Classification Trees that takes into account noisy labels may occur in the training sample. The motivation of new is based on superaditive effect combining together margin classifiers and outlier detection techniques. Our approach rests two main elements: (1) splitting rules for classification trees are designed maximize separation between classes applying paradigm SVM; (2) some sample allowed be changed during construction tree trying detect label noise. Both features considered integrated design resulting Tree. We present Mixed Integer Non Linear Programming formulation problem, suitable solved using any available off-the-shelf solvers. model analyzed tested battery standard datasets taken from UCI Machine Learning repository, showing effectiveness our approach. computational results show most cases outperforms both accuracy AUC benchmarks provided by OCT OCT-H.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning with Confident Examples: Rank Pruning for Robust Classification with Noisy Labels

P̃ Ñ learning is the problem of binary classification when training examples may be mislabeled (flipped) uniformly with noise rate ρ1 for positive examples and ρ0 for negative examples. We propose Rank Pruning (RP) to solve P̃ Ñ learning and the open problem of estimating the noise rates. Unlike prior solutions, RP is efficient and general, requiring O(T ) for any unrestricted choice of probabili...

متن کامل

Learning of Classification Models from Noisy Soft-Labels

We develop and test a new classification model learning algorithm that relies on the soft-label information and that is able to learn classification models more rapidly and with a smaller number of labeled instances than existing approaches.

متن کامل

Learning with Noisy Labels

In this paper, we theoretically study the problem of binary classification in the presence of random classification noise — the learner, instead of seeing the true labels, sees labels that have independently been flipped with some small probability. Moreover, random label noise is class-conditional — the flip probability depends on the class. We provide two approaches to suitably modify any giv...

متن کامل

Robust Speech Recognition Under Noisy Ambient Conditions

Automatic speech recognition is critical in natural human-centric interfaces for ambient intelligence. The performance of an automatic speech recognition system, however, degrades drastically when there is a mismatch between training and testing conditions. The aim of robust speech recognition is to overcome the mismatch problem so the result is a moderate and graceful degradation in recognitio...

متن کامل

Robust Inverse Covariance Estimation under Noisy Measurements

This paper proposes a robust method to estimate the inverse covariance under noisy measurements. The method is based on the estimation of each column in the inverse covariance matrix independently via robust regression, which enables parallelization. Different from previous linear programming based methods that cannot guarantee a positive semi-definite covariance matrix, our method adjusts the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Advances in data analysis and classification

سال: 2021

ISSN: ['1862-5355', '1862-5347']

DOI: https://doi.org/10.1007/s11634-021-00467-2